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Abstract — We show that the network coding and index coding 
problems are equivalent. This equivalence holds in the general 
setting which includes linear and non-linear codes. Specifically, 
we present an efficient reduction that maps a network coding 
instance to an index coding one while preserving feasibility. 
Previous connections were restricted to the linear case. 

I. Introduction 

In the network coding paradigm, a set of source nodes 
transmits information to a set of terminal nodes over a net- 
work while internal nodes of the network may mix received 
information before forwarding it. This mixing (or encoding) 
of information has been extensively studied over the last 
decade (see e.g., (H, 0, (3), (4], Q and references therein). 
While network coding in the multicast setting is currently well 
understood, this is far from being the case for the general 
multi-source multi-terminal setting. In particular, determining 
the capacity of a general network coding instance remains an 
intriguing central open problem, e.g., |6), (7'|, [8|, [9], ifTUl . 

A special instance to the network coding problem intro- 
duced in ifTTl . which has seen significant interest lately, is the 
so-called index coding problem JTl], [12], O, fl4l. lfT31. 
[16|. Roughly speaking, the index coding problem encapsu- 
lates the "broadcast with side information" problem in which 
a single server wishes to communicate with several clients 
each requiring potentially different information and having 
potentially different side information (see Figure [TJa) for an 
example). 

One may consider the index coding problem as a simple 
and representative instance of network coding. The instance 
is "simple" in the sense that any index coding instance can be 
represented as a topologically simple network coding instance 
in which only a single internal node has in-degree greater than 
one and thus only a single internal node can perform encoding 
(see Figure [TJb) for an example). It is "representative" in 
the sense that the index coding paradigm is broad enough to 
characterize the network coding problem under the assumption 
of linear encoding [17|. Specifically, given any instance to 
the network coding problem I, one can efficiently construct 
an instance of the index coding problem I such that: (a) 
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{a} Index coding instance (b) Equivalent network 

Fig. 1. (a) An instance of the index coding problem. A server has 4 binary 
sources X\ , . . . , X4 and there are 4 terminals with different "wants" and 
"has" sets (corresponding to the communication demand and side information 
respectively). How can we satisfy the terminals' demands with a minimum 
number of bits broadcasted by the server? The server can trivially transmit all 
the 4 sources. However, this is not optimal and it is sufficient to broadcast only 
2 bits, namely X\ + X2 + Xz and X\ + X4 ('+' denotes the xor operation), 
(b) Index coding is a special case of the network coding problem. All links 
are of unit capacity (non-specified) or of capacity c. Links directly connecting 
between sources and terminals represent the "has" sets. Any solution to the 
index coding problem with c broadcast bits can be efficiently mapped to a 
solution to the corresponding network coding instance and visa versa. This 
implies that the index coding problem is a special case of the network coding 
problem. The focus of this work is on the opposite assertion. Namely, that 
the network coding problem is a special case of the index coding problem. 

There exists a linear solution to I if and only if there exists 
an optimal linear solution to I, and (b) any optimal linear 
solution to I can be efficiently turned into a linear solution to 
X. All undefined notions above (and those that follow), such 
as "solution," "feasibility," and "capacity", will be given in 
detail in Section [TT1 

The results of jT7] hold for (scalar and vector) linear coding 
functions only, and the analysis there breaks down once one 
allows general coding (which may be non-linear) at internal 
nodes. The study of non-linear coding functions is central to 
the study of network coding as it is shown in ifTSI that non- 
linear codes have an advantage over linear solutions, i.e., that 
there exist instances in which linear codes do not suffice to 
achieve capacity. 

In this work, we extend the equivalence between network 
coding and index coding to the setting of general encoding 
functions (which may be non-linear). Our results imply that, 
effectively, when one wishes to solve a network coding 
instance I, a possible route is to turn the network coding 
instance into an index coding instance I (via our reduction), 



solve the index coding instance I, and turn the solution to 
I into a solution to the original network coding instance I. 
Hence, any efficient scheme to solve index coding will yield 
an efficient scheme for network coding. Stated differently, our 
results imply that an understanding of the solvability of index 
coding instances will imply an understanding of the solvability 
of network coding instances as well. 

The remainder of the paper is structured as follows. In 
Section [TTJ we present the models of network and index 
coding. In Section III we present an example based on the 
"butterfly network" that illustrates our proof techniques. In 



Section IV we present the main technical contribution of this 
work: the equivalence between network and index coding. 
In Section [V] we show a connection between the capacity 
regions of index coding and network coding in networks with 



collocated sources. Finally, in Section VI we conclude with 
some remarks and open problems. 

II. Model 

In what follows we define the model for the network coding 
and index coding problems. Throughout this paper, "hatted" 
variables (e.g., x) will correspond to the variables of index 
coding instances, while "unhatted" variables will correspond 
to the network coding instance. For integers k > 0, we use 
[k] to denote the set {1, . . . , k}. 

A. Network coding 

An instance 1 — (G, S, T, B) of the network coding 
problem includes a directed acyclic network G — (V,E), a 
set of sources nodes S C V, a set of terminal nodes T C V, 
and an \S\ x \T\ requirement matrix B. We assume, without 
loss of generality, that each source s E S has no incoming 
edges and that each terminal t € T has no outgoing edges. 
Let c e denote the capacity of each edge e € E, namely for 
any block length n, each edge e can carry one of the 2 c ° n 
messages in [2 Cc "]. In our setting, each source s € S holds a 
rate R s random variable X s uniformly distributed over [2' R °"']. 
The variables describing different messages are independent. 
We assume that values of the form 2 c " n and 2 RsTi are integers. 

A network code, (T,X) = {(f e ,X e )} U {gt}, is an 
assignment of a pair (f e ,X e ) to each edge e G E and a 
decoding function {g t } to each terminal t E T. For e = (u, v), 
f e is a function taking as input the random variables associated 
with incoming edges to node u, and X e € [2 c " n ] is the random 
variable equal to the evaluation of f e on its inputs. If e is an 
edge leaving a source node s € S, then X s is the input to f e . 
The input to the decoding function g t consists of the random 
variables associated with incoming edges to terminal t. The 
output of g t is required to be a vector of all sources required 
by t. 

Given the acyclic structure of G, the functions {f e } and 
their evaluation {X e } can be defined by induction on the 
topological order of G. Namely, given the family {/ e } one 
can define a function family {/ e }lj such that each f e takes 

In the network coding literature, {/ e } and {fe} are sometimes referred 
to as the local and global encoding functions, respectively. 



as input the source information {X s } and outputs the random 
variable X e . More precisely, for e = (u, v) in which u is a 
source node, define f e = f e . For e = (u,v) in which u is 
an internal node with incoming edges In(e) = {e' l5 . . . ,e' e }, 
define f e = fdfe^ , ■ ■ ■ , fe' e )- Namely, the evaluation of f e on 
source information {X s } equals the evaluation of f e given the 
values of / e ' for e' E In(e). We will use both {f e } and {/ e } 
in our analysis. 

The |5| x \T\ requirement matrix B = [hj] has entries in 
the set {0, 1}, with & s t = 1 if and only if terminal t requires 
information from source s. 

A network code (J 7 , X) is said to satisfy terminal node t 
under transmission (x s : s € S) if the decoding function g t 
outputs (x s : b(s,t) = 1) when (X s : s E S) = (x a : s G S). 
The Network code (J 7 , X) is said to satisfy the instance I 
with error probability e > if the probability that all t € T 
are simultaneously satisfied is at least 1 — e. The probability 
is taken over the joint distribution on random variables (X s : 
s e S). 

For a rate tuple R = (Ri, . . . , R\s\), an instance I to 
the network coding problem is said to be (e, R, n) -feasible 
if there exists a network code (F^X) with block length n 
that satisfies I with error at most e when applied to source 
information (Xi, . . . , X\s\), where X s is uniformly distributed 
over [2 Rs ' n ]. An instance 1 to the network coding problem is 
said to be i?-feasible if for any e > and any 5 > there 
exists a block length n such that I is (e, R(l — 5), n)-feasible. 
Here, R(l-6) = (i?i(l - 5), . . . , R {S \ (1 - 6)). The capacity 
region of an instance 1 refers to all rate tuples R for which 
1 is i?-feasible. 

B. Index coding 

The index coding problem captures the "broadcast with 
side information" problem in which a single server wishes 
to communicate with several clients each having different 
side information. Specifically, an instance to index coding 
includes a set of terminals T — {t%, . . . ,t,f,} and a set of 

sources S = {sx, §2, ■ ■ ■ , available at the server. Given a 
block length n, source s holds a rate Rg random variable Xg 
uniformly distributed over [2 fls " n ] (and independent from other 
sources). Each terminal requires information from a certain 
subset of sources in S. In addition, information from some 
sources in S are available a priori as side information to each 
terminal. Specifically, terminal t E T is associated with sets: 

• Wi is the set of sources required by t, and 

• Hi is the set of sources available at t. 

We refer to and as the "wants" and "has" sets of t, 
respectively. The server uses an error-free broadcast channel 
to transmit information to the terminals. The objective is to 
design an encoding scheme that satisfy the demands of all 
the terminals while minimizing the number of uses of the 
broadcast channel (See Figure [TJ. 

Formally, an instance 1 to the index coding problem is 
the tuple (S,f An index code (£ B ,T>) for T 
with broadcast rate cb, includes an encoding function £b 
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(a) Butterfly Network 



(b) Equivalent Index Coding Instance 



Fig. 2. (a) The butterfly network with two sources X\ and X2 and two 
terminals t\ and <2. (b) The equivalent index coding instance. The server has 
9 sources: one for each source, namely {X\, X2}, and one for each edge in 
the network, namely {X ei , . . . ,X e7 }. There are 7 clients corresponding to 
the 7 edges in the network, 2 clients corresponding to the two terminals of 
the butterfly network and one extra terminal t a n. 

for the broadcast channel, and a set of decoding functions 
T) = {Dtl^f with one function for each terminal. The 
function <5b is a function that takes as input the source 
random variables {Xg} and outputs a rate eg random variable 
Xb £ [2 Cb ™]. The input to the decoding function _D ( - consists 
of the random variables in 22 t - (the source random variables 
available to t) and the broadcast message Xb- The output of 
Df is intended to be a vector of all sources in W{ required by 
t. 

An index code (£5,2?) of broadcast rate cb is said to satisfy 
terminal t under transmission (x$ : s £ S) if the decoding 
function £> t - outputs (xg : s £ Wt) when (X§ : s £ S) = 
(x$ : s £ S). Index code (£b,T>) is said to satisfy instance I 
with error probability e > if the probability that all t £ T 
are simultaneously satisfied is at least 1 — e. The probability is 
taken over the joint distribution on random variables { X g} - g g. 

For a rate tuple R — (R\, . . . , R§i) and broadcast rate 
cb, an instance I to the index coding problem is said to be 
(e, R, cb, n)-feasible if there exists an index code (£3, 2?) 
with broadcast rate cb and block length n that satisfies 1 with 
error at most e when applied to source information (Xg : s £ 
S) where each X$ is uniformly and independently distributed 
over [2- R * n ]. An instance 1 to the network coding problem is 
said to be (R, cs)-feasible if for any e > and 5 > there 
exists a block length n such that 1 is (e, R(l — 6),CB,n)- 
feasible. As before, R(l-S) = (Ri(l - S), . . . , R,§, (1 - 5)). 
The capacity region of an instance I refers to all rate tuples 
R and capacities cb for which I is (R, cb) -feasible. 

III. Example 

Our main result states that the network coding and index 
coding problems are equivalent (for linear and non-linear 
coding). Theorem [T] in Section IV gives a formal statement of 
this result. The proof is based on a reduction that constructs for 
any given network coding problem an equivalent index coding 
problem. In this section, we explain the main elements of our 
proof by applying it to the butterfly network (T) example in 



Fig. |2|a). For simplicity, our example does not consider any 
error in communication. Our reduction goes along the lines of 
the construction in iflTl . while our analysis differs to capture 
the case of non-linear encoding. 

We start by briefly describing the butterfly network depicted 
in Fig. |2ja). The network has two information sources s% and 
S2 that hold two uniform i.i.d binary random variables X\ and 
X2, respectively. There are also two terminals (destinations) t\ 
and t 2 that want X 1 and X 2 , respectively. All the edges in the 
network, c \, . . . , e-j, have capacity 1. Following the notation 
in Section II-A let f et (X±,X2) be the one-bit message on 



edge Ci. Then, the following is a network code that satisfies 
the demands of the terminals: 



Jei = fe 2 = Xi 
fe 3 = fa = X 2 
/e 5 = fee = fe 7 = Xi + X 2 , 



(1) 



where '+' denotes the xor operation. Terminal t\ can decode 
X\ by computing Xi = f e4 + f e7 , and t 2 can decode X 2 
by computing X 2 = f ei + f efj . Thus, the butterfly network is 
(e,R,l) = (0, (1, 1), l)-feasible. 

The problem now is to construct an index coding instance 
that is "equivalent" to the butterfly network, i.e., any index 
code for that instance would imply a network code for the 
butterfly network, and vice versa. We propose the following 
construction, based on that presented in [17|, in which the 
server has 9 sources split into two sets (see Figure E). 

• X\ and X 2 corresponding to the two sources Xi and X 2 
in the butterfly network. 

• X ei , . . . , X er corresponding to the edges c\,...,c-j in 
the butterfly network. 

There are 10 clients, as described in the Table in Fig. [2jb). 
These clients are split into 3 sets: 

• A client t ei for each edge ej. Client t ef wants X ei and 
has the variables X ej for each edge ej in the butterfly 
network that is incoming to e^. 

• A client tj for each network terminal f^. ti wants X{ and 
has the variables X ej for each edge ej in the butterfly 
network that is incoming to tj. Namely t\ wants X\ and 
has X ei and X e7 , whereas t 2 wants X 2 and has X ei and 
X e6 . 

• One client i a ii that wants all variables that correspond 
to edges of the butterfly network (i.e., X ei X er ) and 
has all variables that correspond to sources of the butterfly 
network (i.e., Xi and X 2 ). 

Next, we explain how the solutions are mapped between 
these two instances. While "Direction 1" strongly follows the 
analysis appearing in ifTTl . our major novelty is in "Direction 
2" (both proof directions are presented below for completion). 

Direction 1: Network code to index code. Suppose we 
are given a network code with local encoding functions f ei , 
and global encoding functions f e . (Xi,X 2 ),i = 1, . . . , 7. We 
construct the following index code solution in which the server 
broadcasts the 7-bit vector Xb = (Xs(ei), . . . , Xb{^)), 



where 



X B {e i )=X ei +f et (X 1 ,X 2 ), i = l,...,7. (2) 



For instance, the index code corresponding to the network code 
in ([1} is 

X B {ex)=X ei +Xx 
X B (e 2 ) = X e2 + Xi 
X B (e 3 )=X e3 +X 2 

X B ( ei )=X ei +X 2 (3) 
X B (e 5 ) = X e5 +X 1 +X 2 
X B (e 6 )=X ee +X 1 +X 2 
X B (e 7 )=X er +X 1 +X 2 . 

One can check that this index code allows each client to 
recover the sources in its "wants" set using the broadcast X B 
and the information in its "has" set. For example, in the case 
of the index code in (J3), terminal t e5 computes 



X„ 



X B (e 5 ) - (X B (e 2 ) -X e2 )- (X B (e 3 ) ~ X ea ). 



Here, both '+' and '-' denote the xor operation. More specif- 
ically, terminal t e& first computes f e > for its incoming edges 
via its "has" set and X B (i.e., f e2 — X B (e 2 ) — X e2 and 
f e3 = X B (e 3 ) — X e3 ). Then using the fact that 

fe 5 (Xi,X 2 ) = fe B ( y fe'(X 1 ,X 2 )\e' is an incoming edge of e 5 J , 

terminal i e5 can compute f es (Xi, X 2 ). Finally, by the defi- 
nition of X B in |2) terminal t e5 recovers X es — X B (e§) — 
f e5 (Xi,X 2 ). By a similar process, every terminal in the index 
coding instance can decode the sources it wants. 

Direction 2: Index code to network code. Let c B equal the 
total capacity of edges in the butterfly network, i.e., c B = 7. 
Suppose we are given an index code with broadcast rate c B 
that allows each client to decode the sources it requires (with 
no errors). We want to show that any such code can be mapped 
to a network code for the butterfly network. Let us denote by 
X B = (X Bi i, . . . , X B> ?) the broadcast information where X B 
is a function, possibly non-linear, of the 9 sources available at 
the server X\ , X 2 and X ei , . . . , X er ■ 

For every client t, there exists a decoding function Df that 
takes as input the broadcast information X B and the sources 
in its "has" set and outputs the sources it requires. Namely, 
we have the following functions: 

Di ei (XB,Xi) =X ei D^{Xb,X\) = X e2 

D t e3 (X B ,X 2 ) = Xe 3 Df^ (X B ,X 2 ) = X e4 

-°t" (Xb , X e2 , X £3 ) = X es Df (X B , X es ) = X e6 (4) 

c 5 6 

Df (Xb , X e5 ) = X ev Df ± (Xb , X e4 , X ev ) = X\ 
Df 2 (X B ,X ei ,X ee ) = X 2 _D t - aii (X s ,Xi,X 2 ) 

— (X et , . . . , X e7 ). 

We will use these decoding functions to construct the network 
code for the butterfly network. Consider for example edge e$. 
Its incoming edges are e 2 and e%, so we need to define a local 



encoding f es which is a function of the information X er> and 
X e3 they are carrying. In our approach, we fix a specific value 
a for X B , and define 

fe 5 (X e2 ,X ea ) = Df (a,X e2 ,X e ). 

Similarly, we define the encoding functions for every edge in 
the butterfly network, and the decoding functions for the two 
terminals t\ and t 2 . The crux of our proof lies in showing that 
there exists a value of a for which the corresponding network 
code allows correct decoding. In the example at hand, one may 
choose a to be the all zero vector (or actually any vector 
for that matter). The resulting network code is: 



/<* = D l 

fe 3 = D; 
fe B - Df 
fei = Df, 

Terminals t\ 



„(0,*a) 

_ (0) fe 2 i fez) 

5 

,(0,/e 5 ). 



fei 

f<: 1 

fe 6 



~ Df e 
= D t 



( (0,X 2 ) 

,(0,/e 5 ) 



(5) 



functions 



£> ?i (0,/e 4) / e 



and t 2 can decode using the 
) and D f 2 (0, f ei , /e 8 ), respectively. 
To prove correct decoding, we show that for any fixed values 
of Xi and X 2 , there exists a unique value for the vector 
(X ei , . . . , X e7 ) that corresponds to X B = 0. Otherwise, it can 
be seen that t a u cannot decode correctly since c B = 7 and X B 



is a function of X\ , X 2 and X e 



, X eT . Roughly speaking, 



this correspondence allows us to reduce the analysis of correct 
decoding in the resulting network code, to correct decoding in 
the original index code. Full details of this reduction, and on 



how to choose a appear in the upcoming Section IV 



IV. Main Result 



We follow the proof of 11711 to obtain our main result. 

Theorem 1: For any instance to the network coding prob- 
lem I one can efficiently construct an instance to the index 
coding problem I and an integer c B such that for any rate 
tuple R, any integer n, and any e > it holds that I is 
(e,R,n) feasible iff I is (e, R,c B ,n) feasible. Here, the 
rate vector R for 1 can be efficiently computed from R; 
and the corresponding network and index codes that imply 
feasibility in the reduction can be efficiently constructed from 
one another. 

Proof: Let G = (V, E), and I = (G, S, T, B). Let n be 
any integer, and let R = (Ri, . . . , i?|s|). We start by defining 
X = (S,f, {Wt\,{H { }), the integer c B , and the rate tuple 
R. See Figure u\ for an example. To simplify notation, we 
use the notation Xg to denote both the source s € S and 
the corresponding random variable. For e = (u, v) in E let 
In(e) be the set of edges entering u in G. If u is a source s 
let In(e) — {s}. For ti £ T, let In(ti) be the set of edges 
entering ti in G. 

• S consists of |5| + \E\ sources: one source denoted X s 
for each original source s in I and one source denoted X e 
for each edge e in G. Namely, § — {X s } seS L){X e } e( z E . 

• T consists of \E\ + \T\ + 1 terminals: \E\ terminals 
denoted t e corresponding to the edges in E, \T\ terminals 



denoted £j corresponding to the terminals in X, and a 
single terminal denoted t a u- Namely, T = {t e } ee E U 
Oi}ie[|T|] U 

• For £ e G T we set 
{X e }. 

• For € T, let £j be the corresponding terminal in T. We 



{X e ,} e : eIn{e) and W fe 



set ffi 



{X e ,} e , eIn(ti) and W f . 



s:6(s, *;) = !■ 



For t a u set ff t - a;j = {X s } seS and W iaii = {X e } eeE . 
Let R be a vector of length \S\ + \E\ consisting of two 
parts: (R s : s G S) represents the rate R s of each X 8 
and (R e : e G E) represents the rate R e of X e . Set 
R s = R s for each s E S and R e — c e for each e G E. 
(Here i? s is the entry corresponding to s in the tuple R, 
and c e is the capacity of the edge e in G.) 
Set c B to be equal to J^eeE c e = S 



We now present the two directions of our proof. The fact 
that I is (e,R,n) feasible implies that I is (e, R,CB,n) 
feasible was essentially shown in ifTTll and is presented here 
for completeness. The other direction is the major technical 
contribution of this work. 

1 is (e,R,n) feasible implies that I is (e, R,c B ,n): 

For this direction we assume the existence of a network 
code (F, X) = {(f e ,X e )} U {g t } for 1 which is (e,R,n) 
feasible. As mentioned in Section|I] given the acyclic structure 
of G, one may define a new set of functions f e with input 
{X s } se s such that the evaluation of f e is identical to the 
evaluation of f e , which is X e . We construct an index code 
(£b,T>) = (£B,{D t }) for X. We do this by specifying the 
broadcast encoding £b and the decoding functions {£> t -} t ~ g j,. 

The function £ b will be defined in chunks, with one chunk 
(of support size [2 Cc ™]) for each edge e G E denoted £_e(e). 
We denote the output of £b (e) by Xb (e) and the output of £b 
by the concatenation Xb of the output chunks ^(e). In what 
follows we use 'a+b' as the bitwise xor operator between equal 
length bit vectors a and b. For each e 6 E, the corresponding 
chunk in £s(e) will be equal to X e + f e (Xi, . . . ,X\s\)- It 
follows that £b is a function from the source random variables 
of X to Xb with support 



2^ 



, R.n 



[2 S 



We now set the decoding functions: 

• For t e in T we set Z? t ~ to be the function defined by the 
following decoding scheme: 

- First, for each e' G In(e), using the information 
in Hf , the decoder computes X B (e') + X e > = 

fe'(Xi, ■ ■ ■ ,X\s\)+X e i +X e > = f e '(Xi, . . . 

- Then, let In(e) = {e' 11 . . . , e' e }. Using the function 
/ e from network code (J 7 , X), the decoder computes 

feife'^Xi, . . . ,X\s\) : ■ ■ • , fe' e (Xl, . . . ,X\s\))- 

By definition of f e this is exactly f e (Xi, . . . , Xigi). 

- Finally, compute 

Xaie) + / e (ii, . . -,X\ S \) = X e , 



which is the source information client t e wants in X. 

• For tj G T the process is almost identical to that above. 
Let ti be the corresponding terminal in T. The function 
g ti is used on the evaluations of f e > for e' G In(ti), 
and the outcome is exactly the set of sources W- t . = 
{X s } S :b(s,u)=i wanted by U. 

. For tail, recall that H iau = {X s } seS and W{ all = 
{X e J ee E- To obtain X e the decoder evaluates Xs(e) + 
fe(Xi, . . . ,X\ S \). 

Let £ > 0. We now show that if the network code (J 7 , X) 
succeeds with probability 1 — e on network X (over the sources 
{X s } s6 s of rate tuple R), then the corresponding index code 
also succeeds with probability 1 — e over X with sources {X} 
of rate tuple R. 

Consider any realization x = {x s } of source information 
{X s } of the given network coding instance X for which all 
terminals of the network code decode successfully. Denote a 
realization of source information {Xg} Se § in X by (x s ,x e ), 
where x s corresponds to the sources {X s } se s and x e cor- 
responds to sources {X e } eG E- Let x s (s) be the entry in x s 
corresponding to source X s for s G S, and let x e (e) be the 
entry in x e corresponding to source X e for e G E. Consider a 
source realization (x s , x e ) in X "corresponding" to x = {x s }\ 
namely, for s G S set x s (s) = x s and set x e to be any 
complementary source realization. 

For source realization x of X, let x e be the realization of 
X e transmitted on edge e in the execution of the network 
code {F,X). By our definitions, it holds that for any edge 
e G E, / e (x s ) = / e (x) = x e . It follows that the realization 
of X B (e) = X e + f e {X u ...,X\ S \) is x e (e) + / e (x s ) = 
x e (e) + x e . In addition, as we are assuming correct decoding 
on x, for each terminal fcj G T of X it holds that gi (x e / : e' G 
In(ti)) = (x s : b(s,ti) = l). 

Consider a terminal t e in I, The decoding procedure 
of t e first computes for e' G In(e) the realization of 
X B (e') + X e i which by the discussion above is exactly 
x e (e') + x e / + x e (e') = x e /. Then the decoder computes 
/ e (x e / : e' € In(ej) = / e (x) = J e (x s ) =_x e . Finally, the 
decoder computes the realization of Xs{e)+f e {Xi, . . . , Xigi) 
which is x e (e) + x e + x e = x e (e) which is exactly the 
information that the decoder needs. 

Similarly, consider a terminal t t in X corresponding to a 
terminal ti G T of X. The decoding procedure of ti first 
computes for e' G In(ti) the realization of X B (e') + X e > 
which by the discussion above is exactly x e /. Then the decoder 
computes g,(x e / : e' G In(ti)) — (x s : b(s,ti) — 1), which 
is exactly the information needed by £j. 

Finally, consider the terminal t a ii- The decoding procedure 
of t a u computes for each e G E the realization of Xg(e) + 
f e (Xt,...,X\ s \) which is x e (e) + x e + x e = x e (e) which 
again is exactly the information needed by i a u. 

All in all, we conclude that all terminals of X decode cor- 
rectly on source realization (x s ,x e ) corresponding to source 
realization x of X which allows correct decoding in X. This 
implies that the instance X is indeed (e, R, cb, n) feasible. 



T is (e, R, c B ,n) feasible implies that 1 is (e, R, n) feasible: 

Here, we assume that X is (e, R, cb, n) feasible with cb as 
defined above. Thus, there exists an index code (£3, 2?) = 
(£b, {D{\) for I with block length n and success probability 
at least 1 — e. In what follows we obtain a network code 
(J-, X) — {(/ e , X e )}U{gt} for X, The key observation we use 
is that by our definition of cb — See£ tne su PP ort [2 Csn ] 
of the encoding £b is exactly the size of the (product of) the 
supports of the source variables {X e } in I. The implications 
of this observation are described below. 

We start with some notation. For each realization x s = 
{^sjsgs °f source information {X s } in I, let A% s be the re- 
alizations x e = {x e } e& E of {X e } eG E for which all terminals 
decode (x s ,x e ) correctly. That is, if we use the term "good" 
to refer to any source realization pair (x s ,x e ) for which all 
terminals decode correctly (X s ,X e ) = (x s ,x e ), then 

Az B = {x e I the pair (x s ,x e ) is good}. 

Claim 1: For any given a £ [2 CB "] and any x s , there is at 
most one x e £ A% B for which fs(x s ,x e ) = a. 

Proof: Let x s = {x s } se s be a realization of the source 
information {X s } s( zs- Treat the broadcasted value £b(x s , x e ) 
as a function of x e . Namely, set £ _b(x s , X e ) ~ £z B (X e ). 

Now, for any x s and any x e £ A% b , it holds that terminal 
t a u will decode correctly given the realization of the "has" 
set Hi u = x s and the broadcasted information X B via £b- 
Namely, Z) t - a (£ Xs (x e ),x s ) = x e . We now show (by means 
of contradiction) that the function £ Xs (x e ) obtains different 
values for different x e £ A% B . This will suffice to prove our 
assertion. 

Suppose that there are two values x e 7^ x.' e in A% B 
such that £ Xs (x e ) = £z B (x' e ). This implies that x e = 
D ian (£z B (Z e ),x s ) = \„(£x s (Xg),x s ) = x^,, which gives a 
contradiction. ■ 

Claim 2: There exists a a £ [2 Cb ™] such that at least a 
(1 — e) fraction of source realizations x s satisfy £b(% s , x e ) = 
a for some x e E A Xs . 

Proof: Consider a random value a chosen uniformly 
from [2 CB ™]. For any partial source realization x s , the prob- 
ability that there exists a realization x e e A Xs for which 
^(xjce) = a is at least |A x J/2 fiBn . This follows by 
Claim 111 since for every x e G A Xs it holds that £ , B(x s ,x e ) is 
distinct. Hence, the expected number of source realizations 
x s for which there exists a realization x e € A Xs with 
£ , s(x s ,x e ) = a is at least 

Ex s l^xj > (l- £ )2"E. e s*-+S^> 



2cb« — 2 CB ™ 
We use here the fact that the total number of source realiza- 
tions (x s ,x e ) for which the index code (£b,2?) succeeds is 
exactly ^ x | A Xs |, which by the e error assumption is at least 

(1 _ £ )2™E«es-R»+i:eeB-Re)_ 

Since c B = Y^eeE^e, 



(1 _ e )2«(£ ses 

2c B n 



which, in turn, is exactly the size of a (1 — e) fraction of all 
partial source realizations x s . 

We conclude that there is a a e [2 CB?l ] which "behaves" at 
least as well as expected, namely a value of a that satisfies 
the requirements in the assertion. ■ 

We will now define the encoding functions of (J 7 , X) for 
the network code instance X, Specifically, we need to define 
the encoding functions {/ e } and the decoding functions {g t } 
for the edges e in E and terminals t in the terminal set T of 
I. We start by formally defining the functions. We then prove 
that they are an (e, R, n) feasible network code for I. 

Let a be the value specified in Claim [2] let A a be the set 
of partial realizations x s for which there exists a realization 



x e € Az B with £ B (x s ,x e ) 
is at least (1 - e)2 n GC«es ^ 
For e e S let 



a. By Claim pi the size of A 

= (1 - £ )2 n &£sRs). 



fe 



2"Ee' S 7n(e)' 



[2" 



be the function that takes as input the random variables (X e i : 
e' e Zn(e)) and outputs X e = D$ (a, {X e < : e' e /n(e))). 
Here, we consider X e i for e' £ £ to be a random variable of 
support [2 c =' Tl ]. 

For terminals U € T in I let 



.9/, 



(l-e)2 n CC. 6fl A.) j 



be the function that takes as input the random variables (X e > : 
e' £ In(U)) and outputs Dj. (a, (X e * : e' £ In(U))). 

We will now show that the network code defined above 
decodes correctly with probability 1— e. Consider any rate R = 
. . . j-Risi) realization of the source information in I: x = 
{x s }. Consider the source information x s of I corresponding 
to x, namely let x s = x. Assume that x s £ A a . Using Claim|2] 
let x e be the realization of source information {X e } in I for 
which £s(x s ,x e ) = a. Recall that, by our definitions, all 
terminals of I will decode correctly given source realization 
(x s ,x e ). For s £ S, let x s (s) = x s be the entry in x s that 
corresponds to X s . For e £ E, let x e (e) be the entry in x e 
that corresponds to X e . 

We show by induction on the topological order of G 
that for source information x the evaluation of f e in the 
network code above results in the value x e which is equal 
to x e (e). For the base case, consider an edge e = (u,v) 
in which u is a source with no incoming edges. In that 
case, by our definitions, the information x e on edge e equals 
f e (x s ) = t) ic (a,x s ) = D t - e (£ B (x s ,Xe),x s (s)) = x e (e). 
Here, the last equality follows from the fact that the index code 
(£B,i > ) succeeds on source realization (x s ,x e ), and thus all 
terminals (and, in particular, terminal t e ) decode correctly. 

In general, consider an edge e = (u, v) with incoming edges 
e' £ In(e). In that case, by our definitions, the information x e 
on edge e equals f e (x e > '■ e' £ In(e)). However, by induction, 
each x e i for which e' £ In{e) satisfies x e > — x e (e'). Thus 
x e = £»t c (cr, {x e , : e' £ In(e))) = D {e (£ B (x s ,-k e ), (x e (e') : 
e' £ In(e))) = x e (e). As before, the last equality follows 



from the fact that the index code (£3 , 2?) succeeds on source 
realization (x s ,x e ). 

Finally, we address the value of the decoding functions 
gt- By definition, the outcome of gt is exactly Df. (a, (x e i : 
e' e Info))) = 4- i (^B(x s ,x e ),(x e (e') : e' e In(t t ))) = 
(x s (s) : b(s,ti) = 1) = (x s : b(s,ti) = 1). As before, 
we use the inductive argument stating that x e > — x e (e'), 
and the fact that the index code (£b,T>) succeeds on source 
realization (x s ,x e ), and thus all terminals (and, in particular, 
terminal f,*) decode correctly. This suffices to show that the 
proposed network code (J 7 , X) succeeds with probability 1 — e 
on source input of rate tuple R. We have presented correct 
decoding for I when x = x s € A a , and shown that 
\A a \ > (1 — e)2 n ^*es R »). Therefore, we have shown correct 
decoding for I with probability at least (1 — e). ■ 

V. Capacity regions 

In certain cases, our connection between network and index 
coding presented in Theorem [T] implies a tool for determining 
the network coding capacity via the capacity of index coding 
instances. Below, we present such a connection in the case 
of collocated sources (i.e., for network coding instances in 
which all the sources are collocated at a single node in the 
network). Similar results can be obtained for "super source" 
networks (studied in, e.g., [19], 11201 ). We discuss general 
network coding instances in Section [VI] 

Corollary 1: For any instance to the network coding prob- 
lem 1 where all sources are collocated, one can efficiently 
construct an instance to the index coding problem I and an 
integer cb such that for any rate tuple R: R is in the capacity 
region of I iff (R,Cb) is in the capacity region of I, Here, 
the rate vector R for I can be efficiently constructed from R. 

Proof: Let 1 be an instance to the network coding 
problem and let R be any rate tuple. The instance I, the rate 
tuple R and the integer cb are obtained exactly as presented 
in Theorem [T] We now show that any R is in the capacity 
region of 1 iff (R, cb) is in the capacity region of X. 

From network coding to index coding: Suppose that R 
is in the capacity region of the network coding instance 
I. Namely, for any e > 0, any 6 > 0, and source rates 
R(l - S) = (i?i(l - 5),...,R\ S \(1 ~ <0)> there exists a 
network code with a certain block length n that satisfies I 
with error probability e. As shown in the proof of the first 
direction of Theorem [T[ this network code can be efficiently 
mapped to an index code for I of block length n, broad- 
cast rate equal to cen, error probability e and source rates 
Rs = ({R s (l - 6)} seS ,{R e } eeE )- Therefore, for any e > 
and any 6 > 0, there exists a block length n for which 1 
is (e,Rs,CB 7 n) feasible, and thus (R,cb) is in the capacity 
region of I. 

From index coding to network coding: Suppose that 
(R, cb) is in the capacity region of I. Recall that R = 

{{R s } se s, {R e }eeE) and c B = E eS _B R e- Therefore, for any 
e > and any S > there exists an index code with a 
certain block length n and error probability e such that I 



is (e,R(l — S), cb, ra)-feasible. Note that we cannot readily 
use the proof of the second direction of Theorem [T] to map 
this index code into an network code for I. That is because 
this map requires that cb be equal to the sum of rates of 
random variables in I that correspond to edges in E, namely 
that £b = (1 — S) J2eeE ^e- However, in our setting we have 
cb = J^eeE^e- This (small) slackness will not allow the 
proof of Theorem [T] to go through. Instead, we proceed by 
first stating a claim similar to Claim |2j which will allow us to 
prove our results in the setting in which all sources in 1 are 
collocated. The proof of Claim [3] appears at the end of this 
section. Throughout, we use the notation set in the proof of 
Theorem Q] 

Claim 3: There exists a set S C [2 Cfin ] of cardinality 



|E|=nlog(4/3)(l 



6)(J2Rs)2 nSi: ° eERe 
ses 



such that least a (1 — 2e) fraction of source realizations x s 
satisfy £ , s(x s ,x e ) = a for some x e £ A% B and some a € S. 

Assuming Claim [3] we will now define the encoding and 
decoding functions for the network coding instance X. Suppose 
that all the sources s € S are collocated at a single node that 
we call the source node. For each source realization x s , the 
source node checks whether there exists x e e Ax s and some 
<j Xs € E such that f , B(x s ,x e ) = cr Xs (Case A) or not (Case 
B). 

In Case A, the network code operates in two phases. During 
the first phase, the source node sends an overhead message to 
all the nodes in the networlr revealing the value of cr Xs using 
at most log |E| bits (by Claim 3k The rate needed to convey the 
overhead message vanishes for arbitrarily small values of 5. In 
the second phase, we implement the network code described in 
the second direction of the proof of Theorem[T](with a = £r Xs )- 
The source x s is transmitted through the network by sending 
on edge e the message X e — Z? t - (<r Xa , (X e > : e' € Jn(e))) 
and each terminal implementing the decoding function 
Df (cr Xs ,(X e / : e' £ In(ti))). The total block length used 
is ri (1 + 5') for 5' = log|E|/n that tends to zero as 5 
tends to zero. It is not hard to verify (based on the proof of 
Theorem [TJ that such encoding/decoding functions for I will 
allow successful decoding when the source realization for I 
is x s . 

In Case B, we allow the network to operate arbitrarily, 
and consider this case as an error. Claim [3] implies that 
Case B will happen with probability at most 2e. Therefore, 
for R = (R Sl) . . . , R S i s i ) the network coding instance I is 
(2e, -"^"f 3 , n(l + 5')) feasible for *' = log|E|/n. As 2e 
tends to zero with e tending to zero, and similarly 5' tends 
to zero as 6 tends to zero, we conclude that 1 is ^-feasible. 



We now present the proof of Claim [3] 

Proof: (Claim [3]) Consider the elements x s for which 
| A* J is at least of size 2 n( - 1 - 5 ^ B ' 1 . Recall that c B = 

2 Any node that cannot be reached by a directed path from the source node 
can be set to remain inactive (not transmit any message) without altering the 
capacity region of the network. 



SeeB ^e- Denote these elements x s by the set A. Notice 
that 

\A\ > (l-2e)-2 n( - 1 -V'Z°zs A >. 

Otherwise the total error in the index code we are considering 
is greater than e, which is a contradiction to our assumption. 

Let £' be a subset of [2 n&B ] of cardinality |S'| = 2 Sn£s 
chosen uniformly at random (i.e., each element of £' is i.i.d. 
uniform from [2" £b ]). For x s e [2 n V-- & ) S»es define the 
binary random variable Z^, such that Z^ = 1 whenever there 
exist x e e Ax s and cr e £' such that ^(x^Xe) = cr, and 
Z% B = otherwise. 

Using Claim [T] we have for any x s e yl that 

|E'| 

Pr(Z Xs = 1) = 1 - ( 1 



>1- 1- 



1^.1 

2«(1 — <5)cb — 1 



2 nc B 



= 1-1- 



1 



2 £' 



|S'| 



> 



We say that the x s e 4 is covered by £' if Zx 9 = 1. It suffices 
to cover all x s e A in order to satisfy our assertion. 

In expectation, £' covers at least - of the elements in A. 
Using the same averaging argument as in Claim [2] it follows 
that there exists a choice for the set £' that covers | of the 
elements in A. By removing these covered values of x s and 
repeating on the remaining elements in A in a similar manner 
iteratively, we can cover all the elements of A. Specifically, 
iterating log \A\/ log (4/3) times (each time with a new £') 
it is not hard to verify that all elements of A will eventually 
be covered. Taking £ to be the union of all obtained in 
iteration i, we conclude our assertion. ■ 

VI. Conclusions 

In this work, we addressed the equivalence between the 
network and index coding paradigms. Following the line of 
proof presented in [ 17 1 for a restricted equivalence in the case 
of linear encoding, we present an equivalence for general (not 
necessarily linear) encoding functions. Our results show that 
the study and understanding of the index coding paradigm 
implies a corresponding understanding of the network coding 
paradigm. 

Although our connection between network and index coding 
is very general it does not directly imply a tool for determining 
the network coding capacity region as defined in Section[TT]for 
general network coding instances. Indeed, as mentioned in the 
proof of Corollary [T] for collocated sources, a naive attempt 
to reduce the problem of determining whether a certain rate 
vector R is in the capacity region of a network coding instance 
I to the problem of determining whether a corresponding rate 
vector R is in the capacity region of an index coding instance 
I, shows that a stronger, more robust connection between 
index and network coding is needed. A connection which 
allows some flexibility in the value of the broadcast rate cb- 
Such a connection is subject to future studies. 



Recently, it has been shown [20 1, [21| that certain intriguing 
open questions in the context of network coding are well 
understood in the context of index coding (or the so-called 
"super-source" setting of network coding). These include the 
question of "zero-vs-e error: "What is the maximum loss in 
rate when insisting on zero error communication as opposed 
to vanishing decoding error?" fl9l , EOlk the "edge removal" 
question: "What is the maximum loss in communication rate 
experienced from removing an edge of capacity 6 > from 
a given network?" It22l . ||231 : and the following question 
regarding the independence of source information: "What is 
the maximum loss in rate when comparing the communication 
of source information that is "almost" independent to that of 
independent source information?" [21 1. 

At first, it may seem that the equivalence presented in this 
work implies a full understanding of the open questions above 
in the context of network coding. Although this may be the 
case, a naive attempt to use our results with those presented 
in EOl . ET1 again shows the need of a stronger connection 
between index and network coding that (as above) allows some 
flexibility in the value of Cb- 
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